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Abstract 

This paper considers the problem of data compression for depen- 
dent quantum systems. It is the second in a series under the same title 
which began with [Hj and continues with [T2|. As in we are inter- 
ested in Lempel-Ziv encoding for quantum Gibbs ensembles. Here, 
we consider the canonical ideal lattice Bose- and Fermi-ensembles. 
We prove that as in the case of the grand canonical ensemble, the 
(limiting) von Neumann entropy rate h can be assessed, via the classi- 
cal Lempel-Ziv universal coding algorithm, from a single eigenvector 
ip of the density matrix p. 



1 Introduction 

This paper continues paper [S| under the same title and extends results es- 
tablished there for grand canonical ensembles to canonical ensembles of ideal 
(free) quantum systems, bosonic or fermionic. This will allow us to anal- 
yse the question of the Bose-Einstein condensation (in the case of a bosonic 
ensemble) and extend results to a number of other integrable models (see 
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The reader is referred to [U] for a general introduction into the subject; here 
we only provide a background formally needed for exposition of the results. 

We consider ideal quantum systems on a cubic lattice Z d . The starting point 
here is a given function u: y G [0, l] d i— > oj(y) (more precisely, a family 
of functions depending on the chemical potential fj,), with non- negative 
values for bosons and real for fermions, describing the energy of normal mode 
y (in a single-particle momentum space [0, An example which we will 
follow closely is where 

^v(y) = \ t 1 - cos ( 27r ^)] - ^ y = (pi. • • • > */<*) e [°» ^ ( L1 ) 



and 



l<l<d 

and /i < for bosons, // 6 R for fermions. Function (jl.ljl determines the 
Fourier transform of the operator — ^A — //I acting in l^i^^) where A stands 
for the discrete Laplacian on Z d . 

Associated with u; M (y) are two important integrals: 

m ^ = i,o # l=Fe-^) dy (L3) 

giving, respectively, the value of the von Neumann entropy rate and the par- 
ticle density, in the thermodynamic limit. (In the left-hand side we put plus 
for bosons and minus for fermions; we will follow this convention throughout 
the paper). Parameter (3 > is the inverse temperature. 

To simplify matters, we will assume that d = 1, though all technicalities 
can be extended to the case of a general d in a straightforward way. 1 The 
free Gibbs grand-canonical Bose- or Fermi-ensemble in a finite volume = 
{0, 1, . . . , £—1} (represented by a segment of the integer lattice Z) is described 
by a density matrix pf acting in the Fock Hilbert space TCf. It is of the form 



1 The issue of Bose-Einstein condensation arises of course for d > 3, unless one uses 
particular boundary conditions. This will be the subject of forthcoming research. 
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where 'Ef = tr n ± exp(—[3Hf) and Hf is the Hamiltonian of the free Bose- or 
Fermi-system in which is the (bosonic or fermionic) second quantisation of 
the single-particle energy operator Hi i in ^(Ae) — C £ . In the above example, 

H eil = ~A e -fiI t (1.4) 

where is the lattice Laplacian in and It the unit matrix in ^(Ag). If we 
impose periodic boundary conditions then the eigenvectors ipe and eigenvalues 
Ki of H^i are naturally labelled by j = 0, 1, . . . , £ — 1: 

<Pi(j) = -^ewfaij/l), Kt{j) = 1 - cos(2irij/£) - fi. (1.5) 

In particular, the eigenvalues He in (|1.5|) are described as uo^{j / 1) where func- 
tion uj was defined in (jl.ljl . 

We retain the notation ipe{j) and for the eigenvectors and eigenvalues 
of a genereal single-particle energy operator H t i in ^(A^), and assume the 
form Ki{j) = LU^j/i) (J = 0, 1, . . .t — 1), where ui^y), is a 'nice' function 
figuring in (jl.lj) and (|1.2|) . See Assumption 11.11 below. 

The condition that Kt(j) is of the form u^j/t) is quite restrictive (although 
it holds in the example of (jl.3|) with periodic boundary conditions). We 
consider it as a first step in studying more general situations. 

Returning to the multi-particle Hamiltonian Hf, its eigenvectors <fii and 
eigenvalues are naturally labelled by occupancy number configurations 
k = (k , . . . kg-i) where entry kj G {0, 1} for fermions and kj G Z + for 
bosons: 

k G {0, if or k G ll. 
Space TiJ has dimension 2 and Tif infinite dimension. 
More precisely, the eigenvectors and eigenvalues of Hf have the form 

#00 = (flM)**) , xf = J2 k Mj)- (i-6) 

Here, subscript S means symmetrisation and A antisymmetrisation of the 

® 

corresponding tensor product ]~[ ipt{j)® kj . 

jeA e 
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In probabilistic terms, the density matrix pj generates a probability distribu- 
tion where an eigenvector (fie(k) 'occurs' with probability ^exp (— /3A^(k)), 



with = ex P ( — /3A^(k)). It is convenient to assign the above probabil- 

ity to occupancy number configuration k i.e. consider a probability measure 
Vf on space 1} (bosons) or {0, 1} £ (fermions) 



As follows from Equation ()1.6|) . the entries Kj, j e A^, of the random config- 
uration K = (Kq, . . . , Ki_i) are independent variables, each with a two-point 
distribution for fermions and geometric for bosons: 



The above independence property is a feature of the grand canonical en- 
semble for free particles. Paper [0] focused on properties of the Lempel-Ziv 
encoding for (the sequence of) probability measures Vi as I — > oo (the ther- 
modynamic limit). The main result of jH] was that if (i) eigenvalues n^{j) 
of the single-particle energy operator Hg i are of the form uj(j/£) (which is 
the case in the example under consideration) and (ii) function u satisfies a 
certain condition (see Assumption 1 from Section 2 of jB]), the Lempel-Ziv 
parsing of the random string K = (K , . . . , Kp_i) indentifies integral ()1.2|) as 
the data compression limit. 

Replacing the grand canonical with the canonical ensemble means fixing 
values ne, i = 1,2, ... for the number of particles. Probabilistically, we 



p±(k) = ^exp (-/3A*(k)). 



Vf{Kj = k) = (1 t e-P'M^e-W'M), 




(1.7) 



have to pass to conditional distributions V l 



V, 




which do not have the independence property. 

It is convenient to specify the integrands in ()1.2|) and ()1.3|) as 




0<y<l, (1.8) 



and 



(1.9) 



1 e -^f.(y) ' 



0<y<l, 
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and interpret the values lp Aj/P) and (j / '£) as the mean and entropy of 
the marginal probability distribution (two-point or geometric) of variable Kj 
in (jl.7|) . j G Ag. [Of course, these values can be expressed in terms of each 
other.] Here, in agreement with we set: 

w M (y) = wo(y) - //, 

although a more general form of dependence can be considered. We also 
assume that uq is a non-negative function for bosons and real for fermions. 

Observe that while parameter \x figures in the definition of the measure Vf, 
it does not in that of the conditioned measure Vf„ . 

From now on, we state conditions that we need in terms of the mean-value 
functions IpAy), and the whole exposition is purely probabilistic. We follow 
the notation and definitions fron Sections 1 and 2 of jSj. 

Assumption 1.1 Functions in M.ty) take 

ye[0, l]^+»e(o,oo) 
y e [0, 1] » ifjy) e (0, 1) 

are continuous in y and for all (3 > and fi < for bosons and \i e R 
for fermions. Moreover, for all (3 > 0, the function fi i— > rrip is monotone 
increasing with \i so that, for all r in the range of this function, there exists 
a unique fj, such that nr^ = r. Here, and below 

J[0,1] 

Furthermore, functions 

ye[0,l]^g^(y)e(0,oo) 
are continuous in y for all j3 > and fi < for bosons and fi G R for fermions. 



Assumption 1.2 Sequence of particle numbers n£ satisfies: 



1 

71 



'lid 



for some r from the range of function \i \— > m 



± 

Am - 



Assumptions 11.11 and 11.21 are henceforth presumed to hold. The main result 
of the paper is the following: 



Theorem 1.3 Consider a triangular array of independent random variables 
Kj , < i < £ — 1, £ — 2, 3, . . . ; (either geometric or 0,1-valued), where 



Kf has mean l^Aj/£) and entropy gf^{j/£)- From define random 

variables Yj e \ distributed as Kj 



(Eo<k<-i k T = ntj ■ 



Then for all (3 > 0, in probability, almost surely and in mean the number of 
words C(Y^) in the Lempel-Ziv parsing of the string Y^ = {Y 1 , . . . } 
satisfies: 



lim 



-C(Yt 



[0,1] 



9p,p(v)dy, 



(1.12) 



where fj, is the unique value for which rrip = r. Here, almost surely is 
understood with respect to the product-measure XiV^ n (see Jffl). 

The almost sure form of convergence is the most subtle, so we treat it as 
principal. The logic behind the proof is as follows. By Shannon's Noiseless 
Coding Theorem (see for example Theorem 5.3.1 of the limit of the 
Shannon entropy of probability distributions Vf n 



lim \h I jr ,...,Xi_, 



E K 



o<i<e-i 



(if it exists) represents the data compression limit for the sequence of canon- 
ical ensemble distributions V in . In terms of the Lempel-Ziv encoding, re- 
peating arguments from Chapter II of JT] yields: 

Lemma 1.4 Almost surely with respect to xV^ n : 



1 D 1 

liminf -^C(Y W ) > liminf -H ( K , . . . , K £ -i 



E K < 



1.13) 



0<i<^-l 



On the other hand, we establish two lemmas: 
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Lemma 1.5 Almost surely with respect to xVg n : 

limsup ^C(YW) < fc± = lim iff (K , . . . , K^) , (1.14) 

i/ie averaged Shannon entropy of probability measure Vg, where fi has been 
specified in Theorem \l.!A 

Lemma 1.6 The entropies obey the bound: 
H [ff ,...,ff|-i Ki = n\ -H{K ,...,K i „ 1 )>8{l,n l ), (1.15) 

\ 0<i<f-l / 

where 

Um-M,n t ) = Q, (1.16) 

Together (|1.13|) - ([1.16|) imply (|1.12|) . In Sections El and El we develop an 
argument that proves Lemma fl. 51 and in Section 0] we prove Lemma fl. 61 To 
simplify the notation, we will often omit superscripts ± and subscripts /3, \x. 



2 Properties of the typical set 

Throughout the rest of the paper, Assumptions 11.11 and 11.21 are presumed 
valid. In this section we concentrate on the fermionic case of 0, 1- valued 
variables - the proofs adapt to the geometric case as in . As in jH] , we will 
use the idea of a typical set - firstly in Proposition 12.11 we shall show that 
the added restriction of being in the typical set forces extra useful properties 
to hold. Then in Proposition 13.51 we shall show that we will 'nearly always' 
be in the typical set, so we can exploit these extra properties. 

We write e a for the entropy of the random variable under consideration (ge- 
ometric or 0, 1-valued) with mean a. Given e G (0, 1) and integer M and j, 
define the typical set of realisations 

( j+M-i ~\ 
lj% = I k : & - KV^)) < Me '\i where e ' = ee ^/( 2L ) ( 21 ) 
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and L = supo^j^j/). 

Suppose the rth word in the Lempel-Ziv parsing begins at t(r), has length 
s(r) and ensemble-entropy defined to be E^(r) = ^l=tM 1 9{ u l$)- We set 
AT = JV(YW) = {t(r) : E^(r) < log£(l - e)} (the set of start-points of low 
ensemble-entropy words). For any sequence kwe can write: 

N C {t(r):EW(r)<log£(l-e),keT$ )Ar) } 

U{*( r ) :k ^?), W }- (2-2) 

We bound the size of the first set in Proposition ^. 11 to find that it is less than 
K\l l ~ e , since these parsed words are short distinct strings in the typical set. 



Proposition 2.1 Given e, ifl(y) is uniformly continuous on [0, 1] and bounded 
above by L there exists a constant K\(e) such that: 

# {r : E«(r) < log*(l - e), k G 7$^} < J^ 1 "* 8 . (2.3) 

Proof We can find a finite number of intervals J, in which our variables 
have their means close together. The key property is that l(y) is (uniformly) 
continuous, so given e, we can calculate iV = N(e) and u\, . . . Un with u% = 0, 
un = 1 such that for i — 1, . . . N — 1: 

sup |Z(a;) - %)| < e'. 
x,ve[uj,tti+i] 

where e' is from Equation (|2.1|) . Define Jj = {j : j/£ G Lj = 

sup xeJt /(a;). 

As in jHj, we compare with a set which we can count and control 

more easily. Given a > and integer M, define: 

V aM = jxf = ( Xl , ...,x M )E {0, 1} M :J2 Xi <M(a + ^)\. (2.4) 

For xf f G T> a>M , writing P a for product measure for independent 0, 1-valued 
random variables with mean a: 

P a (x) = exp f M log(l - a) + logaj^x; \ > exp(-Me a (l + e)). 
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If k G yfy, where j G Jf. 



j+M-l j+M-1 
u=j u=j 

< M(L i + 2e')<M(L i + ee L ./L i ) 



so the sub-string kj +M 1 = (kj, . . . kj + M-i) £ Dl^m- We therefore know 
that if k G T$ )m+S{jyi and s(j) < (log*)(l - e)/ef = M(j) then 

¥ L . (k t{j) , . . . , > IPl, (%), • • • , fct(j) + M0)-i) 

> exp(-M(j)e Li (l + e))=£ 1 -* 2 . 

Since these finite strings are distinct, the number of strings in Jj such that 
these two conditions hold is less than . Summing over intervals Jj, the 
total number of such strings is less than £ l ~ e N. □ 



3 Negative association and the typical set 

In this section, we develop necessary technical tools to work with the canon- 
ical Gibbs distribution, and then finish the proof of Lemma 11.51 The key 
property that we shall use is that our variables K are negatively associated. 
That is, under the condition that YliZo = n t-> since fQ are non-negative, 
if one variable is large, then the others are forced to be smaller. Formally: 

Definition 3.1 A collection of real-valued random variables {Uu) is nega- 
tively associated (NA) if the covariance 

Cov (fiUt :ie A), g(U r .jeB))<0, 

for all increasing functions f and g, taking arguments over disjoint sets of 
indices A and B. 

We require a result that gives a class of variables with conditional distribu- 
tions that are negatively associated. This comes via the idea of logarithmic 
concavity: 



9 



Definition 3.2 A random variable V taking values in Z + with probabilities 
p(s) = F(V = s) satisfies logarithmic concavity (LC) if for all s > 1, p(s) 2 > 
p(s - l)p(s + 1). 

Notice that the 0, 1- valued and geometric distributions have this property. 
This is sometimes referred to as Newton's inequality (see Niculescu [Z]). Fur- 
ther, we can use the following fact, going back to Hoggar (see 0j). 

Theorem 3.3 IfV and W are independent LC random variables, then their 
sum V + W is also LC. 

We also rely on a result of Joag-Dev and Proschan [H] , which is itself based 
on Efron [3]. We reproduce its proof here since the proofs in [3] and [5] only 
describe the case of random variables with densities. 

Proposition 3.4 Let Vi, i = 1,2... be independent Z + -valued LC random 
variables, with sum Se = $^i=i^»- Then for any £ and n, the conditional 
variables W% ~ (Vi\Si — n), i — 1, . . .1, form an NA family. 

Proof First, we establish an assertion similar to the main theorem of 3 : 
if Vi, ... 14 are LC random variables then for any increasing function the 
conditional expectation 

E (0(Vi, . . .Vt)\Si = s) is an increasing function of s. (3.1) 

We prove (|3.1|) by induction on £. 

By log-concavity, p 2 (s + 1 — x)/p 2 (s + 1 — y) < p 2 (s — x)/p 2 (s — y) for integer 
< x < y < s. Then for any 1 < t < x — 1: 

Y, t x=oPl(x)p2(s+ 1 -X) _ Y!x=oPl{x){P2{.S + 1 - X)/ P2 (S + 1 - t)) 

E S x = t Pi(x)Ms + 1 - x)/p 2 (s + l-t)) 

EL =0 Pl(^)(P2(g -x)/p 2 (s-t)) 
52 S x=t Pl( X )(P2{s - x)/p 2 (s - t)) 
J2 t x=0 Pl( X )P2(s-x) 

T, S x=tPi( x )pi( s - x )' 

Now, since a/b < c/d implies that a/ (a + b) < c/(c + d), this gives us that 
for any t, F(Vt < t|Vl + V 2 = s + 1) < P(T4 < t\V\ + V 2 = s), which implies 
Equation for £ = 2. 



Yl x ttPi( x )P2(s + 1 - x) 

< 
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For t > 2 and an increasing function define 

$(t, u) = E (0(^i, . . . V t )\T = t, V t = u), where T = Ztl V i- 

We know that $ is increasing in t by the inductive hypothesis for Equation 
(j3.1|) for I — 1, and in u by the monotonicity of (p. Then 

E ...Vi)\S e = 8) = E($(T, ^)|T + V, = s), 

which is increasing in s by the inductive hypothesis for i = 2. This concludes 
the proof of (|3.1j) . 

Next, as in [5j, we use ()3.1|) . relying on two further results. Firstly, Cheby- 
shev's rearrangement Lemma: for F + increasing and F_ decreasing, 

Cov (F + (X), F-(X)) < (3.2) 

(as EF + (X)EF_(X) - EF + (X)F_(X) = (p{i)p(j)F + (i)F„(j) 
-p(i)p(j)F + (i)F„(i)) = T. 3<l P{i)p{j){F + {i) - F+(j))(F_(j) - F_(i)) > 0, 
where i,j G Z + ). 

Secondly, by expanding with conditioning, for any random variables U, V, W: 

Cov (U, V) = ECov (U, V\W) + Cov (E(U\W),E(V\W)). 

Now, taking U = f{V h i G A)\S t , V = g{V 3 , 3 e and W = (Sa^b) = 

(SigA ^ij SjGB ^i) wnere A,Bc{l,..i} are disjoint sets: 

Cov(/,0|S) = HCov(f,g)\S,S A ,S B ) + Cov(E(f\S A ,S B ),E(9\SAiS B )\S). 

The first term is zero. As for the second term; as S A increases, 13. 1|) implies 
that E(/|5a) increases. At the same time, since Sa + S b = St, S B decreases, 
so again by (j3.1J) . E(<7|Sb) decreases, so we can apply (|3.2j) . This completes 
the proof of Proposition 13.41 □ 

In Proposition 13 . 51 we will work with a general family of Z + - valued variables, 
but assume that functions I and (7 satisfy Assumption 11.11 

Proposition 3.5 Consider a triangular array of Z + -valued random vari- 
ables Ya , j G = {0, . . . £ — 1}, with Y. forming an NA family for each 
t. Assume Y^ have mean l(j/£) and entropy g(j/£) and a uniform bound 
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on their centred fourth moment: ~E(Y- — l(j/£)) 4 < b. Then for any e and 
any rj G (0, 1), there exists a constant K 2 = K 2 (e,r],b) such that for any £ 
large and for C the number of words in the Lempel-Ziv parsing: 



P 



(^E J (Y^W s(r) )>^ <g§. (3.3) 



Here C Z £ + is defined in / TO) . 

Proof By NA, for any i ^ j ^ k ^ m from A^: 






> 






- ^)) 3 , 





> 


E( yW 


-l(i/l))(Y} e) 







> 


E(Yf 






Hence, 


for 


any set AC A?: 





e ( £ (if - w< 

£ (if -«»■/*))*+ 3 -'«/«)* (if-'w/o)' 



< 

< 3b\A\ 2 . 

Define Z r = I(Y^ ^ s ( r ))- Note that by Chebyshev's inequality, for any 
s: 



3b 



- (se) 4 " s 2 e' 4 " 



Hence, it is sufficient to show that J2 r =i s ( r ) 2 /C ^ almost surely, so that 



if E c = J2r=i EZ r, then E c /C -> 0. (3.4) 

However, the counting argument described in Chapter II of ^1] shows that 
this will hold. Specifically, in the finite alphabet case, the number of possible 
parsed words of length less than (log£)/100 is clog££ 1 ^ 100 . In the infinite 
alphabet case, we can truncate as in jB]. 
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Then, by NA, writing Zj for a random variable with the same marginal 
distribution as Zj y but independent of the other Z^. 

P {^{Zr - EZ r ) > vC^j < P i^yZr - EZ r ) > uC^j < exp (-2CV 2 ) 

by Hoeffding's inequality (see for example Note 2.6.2 of P). 
Hence, writing r\C = vC + E c 

P (|> > ,c) < exp (_2C(, - EdCf) < ^ ' Ec/C)i , 

so by (|3.4j) . we are done. □ 
We can now complete the proof of Lemma 11.51 

Proof of Lemma 11.51 We need to bound the \N\, by controlling the two 
sets in the RHS of (|2.2|) . Proposition 12. II gives that the size of the first set is 

o(f- e2 ). 

Proposition 13.41 shows that our variables have the NA property and hence 
we can apply Proposition 13.51 to the size of the second set in the RHS of 
(|2.2J) . Specifically, if C(Y^) grows more slowly than linearly in £/\og£, 
then Theorem 11.31 holds. Otherwise, the upper bound 0(1/ C 2 ) provided 
by of Proposition 13.51 becomes summable in £. Hence by the Borel-Cantelli 
Lemma, almost surely the proportion of words in the second set becomes 
smaller than r], for any rj. Overall then, the \N\ \og£/£ — > 0. 

Then considering the entropy present in the parsed words, we deduce that: 

i 

]>>(^) = £ E( V) > log*(l - c)(C(YW) - \N\). 

j=l r 

On rearranging we obtain that 

limsup !^C(YW) < ^ff(jyf) + e. 



e^oo ! £ 



□ 
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4 The von Neumann entropy of the ensemble 



One might expect that the conditioning might have only a small effect on 
the entropy of the ensemble, and that hence the entropy of the canonical 
and grand canonical ensembles will be very close to one another. This is 
confirmed in this section, in which we prove Lemma fl. 61 

In the IID case of 0, 1-valued random variables, it is clear that Assump- 
tion 11.21 is the right condition, since there the conditioned variables Yj ~ 

[Kj^\ Ylo<i<e-i = n t) ls equiprobable on the (^) possible values, so 



H\K 0l ...,K l 



Ki = nA -H(Kv,...,Kt_. x ) 



log -£(-plogp-(l-p)log(l-p)) 

\ n tJ 

— — p) — >• I) as £ — ► oo. 



(using Stirling's formula, and expanding in a series in rie/£ close to p). 

In the non-IID case we exploit a relationship between the mode and mean 
described by Bottomley PQ. Specifically, for a unimodally distributed random 
variable S: 



|mode(S) -ES| < V3VarS. 

Hence if S is the sum of n 'approximately IID' variables, we expect that the 
mean will be of the order of n, and within sfn of the mode. 

As elsewhere in this paper (and in j^J), we will use the fact that along small 
enough intervals the variables are 'nearly IID'. That is, by continuity of the 
mean-value function /, we can find intervals such that swp yyleI . \l(y) — l(y')\ 
is arbitrarily close to zero. 

Lemma 4.1 Given e > 0, we can find a partition of [0, 1] by intervals Ij = 
[v,j,Uj+i] such that defining 

= i v im 
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then 

lj< lim^/(l + A:*,)<^(l + e). 

where lj = f T l(x)dx 

In the case of 0, 1- variables, we require an extra statement, as follows (the 
case of geometric variables is actually simpler, and discussed at the end of 
the section). 

Lemma 4.2 For the sum S^ M ' of independent 0, 1-variables X±, . . . Xm, with 
EXj = pj, define k* M = (j2iLiPi/0- ~ Pi)) / M > then: 

P(SW = n - 1) n 
P(S(A0 = n) ~ (M-n + l)k* M 

Proof Following Niculescu [? . write E{ for the elementary symmetric func- 
tions of degree i, and Si = Ei/(f) for the averaged version. By Newton's 
inequalities, discussed in [7j, the ratio £ n _i/£ n > £q/£\. Hence, we deduce 
that, writing Uj = pj/(l — pj): 

F(S^=n-l) _ E n -i(m, . ..u M ) _ ( n A ! 1 )£ n -i(u 1 ,...u M ) 

E n {ui, . . . u M ) (™)£ n (ui, . . . um) 

n£ n -i(ui, . . .u M ) 
(M — n+ l)£ n (ui, . . . u M ) 

n£ (iti, . ..u M ) n 

(M -n + l)£i(u h ...u M ) ~ (M-n + l)k* M 

□ 

We will now use a local lattice Central Limit Theorem, the main result of 
Petrov jS] (see Theorem 14.41 below). Note that this result extends a very sim- 
ilar one of Prohorov ^U], which only holds for uniformly bounded variables, 
thus ruling out the geometric case. 

Assumption 4.3 Consider integer-valued random variables X 1 , X 2 , . . . sat- 
isfying the following conditions: 



P(£( M ) = n) 



> 
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1. The highest common factor of the integers j such that 



^fX)P(Jf i = 0)P(X i = j)j - oo 



log 



is 1, where for each X i} by shifting, P(Xj = j) has its largest value at 

J = o. 

2. As n — > oo, cr^ — > oo ; and sup m L m cr m < oo where a 2 = J27=i Var -X 



and 



1 - 

L n = — ^E|X,-EX;| 3 . 



(4.1) 



Theorem 4.4 (Petrov) Lei Xi,X 2 , . . . be independent integer-valued ran- 
dom variables and write = Y^i=xXi, a-n = ES 1 ^ and a 2 = Var S^. 
Then under Assumption \4-5\ there exists a constant C such that for all qinL: 



a n F(S n = q) = exp 

V Z7T 



(g ~ Qn)^ 
2<7* 



< CL n , 



where L n is defined in \4-l\) above. 



To apply Theorem 14.41 we need to check that Assumption 14.31 2 holds in 
both the fermionic and bosonic case. For a 0, 1-variable X with EX = p, 
E|X — p\ 3 = p(l — p)(2p 2 — 2p + 1) < 2Var X. Hence in the fermion case, 
variables K , . . . satisfy: 

®\Ki - EKi\ 3 < c Y VarXi (4.2) 

0<i<l-l 0<i<£-l 

with c = 2 and Assumption 14.31 2 holds. 

For X geometric, we can distinguish two cases, (a) Parameter p < 1/2, so 
that EX = p/(l - p) < 1 and E|X - EX| 3 = E(X - EX) 3 + 2EX 3 P(X = 
0) = p(l + p + 2p 2 - 2p 3 )/(l - p) 3 < 7Var X. (b) Value p > 1/2, so 
EX > 1 and E|X - EX| 3 < EX 3 + EX 3 = (p + Ap 2 + 2p 3 )(l - p) 3 < 
7/(1 - p) 3 < 28EXVar X. In either case the bound is fulfilled with 
c = 28L = 28maXjEXj and again Assumption 14.31 2 holds. 

Proof of Lemma 11.61 Again, let us first consider the case of 0, 1-valued 
variables. We again break the interval [0, 1] down into a collection of smaller 
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ones as described in Lemma 14.11 such that the variables are approximately 
IID on each interval. Refer to the sum over interval Ij as Uj, and set pj = 
Further, for technical reasons, we insist that the point 1/2 occurs at 
the boundary of an interval (so no interval contains points where pj > 1/2 
and pj < 1/2). 

Writing for J2o<i<e-i ^i, we formally expand H(K , . . . , Ke-\\S^ = 
rii) - H(K , . . . ,K e _i) as 

k +...k e _ 1 =n e ^ l ' \ 0<i<£-l 

0<i<£-l k f 

= E E ( n. Kt = » - ^ = = » - ) io g P ( ., = (4,) 

0<i<*-l fej V V ' / 

+ logP(5 w =n € ), (4.4) 
where sf = E^i^i = ^ W " 

We deal with each of ()4.3j) and ()4.4j) separately. The second term, (|4.4jl . is 
bounded directly using Theorem 14.41 since (ri£ — r) 2 / (J2o<i<e-i ^ ar -^i) ~~ * 
as £ -> oo (see lO|) ). 

Next, we split the sum over i up into sub intervals: given a value z, where 
if I G Ij, we define 

l/f = £ ^,£/g = £ K r = \jf-K % ^= K r = S^-Uf\ 

r:r/i£lj r:r/lGlj,r^i r:r/l£Ij 

Then the first term, Equation (|4.3J1 can be rearranged to give the sum over 
i of: 

¥{uf = t)F(S? = n-t) ( P(Z7g =t-k ) 

xF(Ki = k)\ogW(Ki = k). (4.5) 
Now, for any value of t, this inner sum can be rearranged to give: 

/ FCUf] = t - l)/P(Z7fJ = t) - 1 \ 

log - 1 ft 1 " Pi) 1 hl J' 1 h ' • (4-6) 

\l-p i +p i W(ufj^t-l)/¥(ufJ^t)J 
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— CO 

Notice that if t is close to the mode of U ^ , then this is close to zero. Hence, 
if the mean and mode are 'close together' (as [T] ensures), we can produce 
sensible bounds, using Lemma l4~2l 

If Pi < 1/2, log(l/pi — 1) is positive, and so < (1 — Pi)pi log(l/pj — 1) < 1. 
Writing iV for the number of summands in , Lemma 14.21 implies that 

W>(ufj = t-l)/¥(ufj =t)> t/(k*(N-t)), so since f(v) = (v-l)/(l-p+pv) 
is an increasing function in v , we can deduce that ()4.6|) is at least: 

t(k* + 1) - k*N > (1 + k*) 2 / k*N \ > 4 ^ fc*JV 



&*iV(l - p) - t(k(l - p) - p) ~ k*N V k* + lJ~N\ k* + 1 
where the first inequality follows by concavity in t. 



rti I pro- 



Thus we control (J4.3|) through a bound on E f — E[/j^ 
vided by Theorem 14.41 That is, we know that Uf \ Sf and are all close 



to normal, so the expectations will be close to the values they take in the nor- 
mal case. That is, writing Z\, Z 2 and Z% for normal variables, with densities 
01, (f>2 and 03 of mean and variance matching Uf \ and S respectively: 

ynpf ] = y)F(S J =m-y) yUv)^^ ~ y) 



< 



E 

y 



|yP(£/f = y)||P(^f = ni - y) - Mni - y)\ 



\y<p 2 (n e -y)\\F(uf = y)-Mv)\ 

2s f(s® = n e ) 

y 

+ L P(gw ~ nt)Mnt) \ns { >=««)- Mm) 



< m * = - ) (^ar Ijf + \n t - ESf ] \ + yVar S f + ^Var £/f >) 

where e is the largest of the bounds given by Theorem 14 A\ and hence is 
0(l/£). 
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The result follows, using the fact that if Zj are independent iV(/Zj, of) random 
variables (for i = 1,2) then Zi\Zi + Z 2 = [i\ + ^2 + r has a iV(/ix + rof /(of + 
of), ofo|/(of + erf)) distribution. 

For pi > 1/2, the same argument works, only replacing the lower bound 
on P(n — 1) /P(n) from Lemma T4.2I with the corresponding upper bound that 
P(n-1)/P(n) < l*n/(N-n+l), where/* = (J20--Pi)/Pi)/ N - The argument 
goes through in a similar way. □ 

We can use a similar idea in the case of geometric distributions. Again we 
can bound the term (|4.4|) using Theorem 14.41 Then the first term, Equation 
(|4.3jl can be rearranged to give the sum over i of: 

\- g(gg = tMS 3 =n-j r /_. v , P(X t = x)P(^ = t-a;) \ 

(4.7) 

which is the same term previously bounded, and the same arguments apply. 
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